Developing a comprehensive pedagogical framework for pronunciation training based on blended learning and adapted Automatic Speech Recognition systems

نویسنده

  • Saandia Ali
چکیده

This paper reports on the early stages of a locally funded research and development project taking place in a French university. It aims at developing a comprehensive pedagogical framework for pronunciation training for adult learners of English. This framework will combine a direct approach to pronunciation training (Face-to-Face teaching) with online instruction using and adapting existing ASR systems. The sample of learners chosen for the study, are university students majoring in Arts, Literature or Communication at graduate and undergraduate level. These students have generally been learning English for 7 years prior to entering university and might show an advanced mastery of grammar and syntax, but their spoken English remain heavily accented and may hinder effective communication. Furthermore, the classes are overcrowded (up to 40 students per group) and the emphasis is placed on fluency and communication skills rather than phonetic accuracy. In addition to that, most teachers don’t feel confident with teaching pronunciation as they often haven’t received any training themselves. Under these circumstances, students experience performance anxiety, and they only have a limited amount of time for teacher-student interaction and individualised feedback. Language learning appears most efficient when the teacher constantly monitors progress to guide remediation or advancement. Computer Assisted Pronunciation Training programs (CAPT, Abuseleek 2007) could help realising these goals by offering individual practice and feedback in a safe environment. A considerable body of research has already shown the efficacy of ASR systems for pronunciation training (Hincks 2002, Kim 2006 or Elimat 2014). Recent ASR based CAPT programs include Subarashii (Entropic HTK recognizer), VILTS (SRI recognizer), FLUENCY (Carnegie Mellon University SPHINX recognizer), Naturally Speaking (Dragon Systems), and FluSpeak (IBM ViaVoice recognizer). We intend to build on these existing programs and on previous research to develop a set of tools to address bad pronunciation habits among French learners of English. This approach is based on intensive hybrid tuition, starting from teaching English phonology, pronunciation rules and contrastive analysis in the classroom and then developing online courses with embedded ASR systems for autonomous learning with automatic corrective feedback at both segmental (phone) and suprasegmental (intonation) level. Pronunciation is more cognitive than articulatory and we believe that such an approach could provide both a cognitive input so as to help students become more aware of their pronunciation habits and opportunities for practice and feedback. The ideal adapted ASR system should include error detection, scoring of pronunciation, full diagnosis (error visualisation and analysis) and remediation tools. It should also accommodate different levels of proficiency (priorities being set for each level and for each learner profile). Reading tasks and virtual conversations based on elicited language will be used for the online course. Finally, most CAPT systems present part of the feedback through the recording and visual representation of learners’ performance which is compared to the recording of a native speaker. Commonly used spectrograms will be enriched with modelled representations that are closer to learners’ perception using pitch modelling algorithms such as Momel (Modelling Melody, Espesser & Hirst 1993).

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Towards the development of a comprehensive pedagogical framework for pronunciation training based on adapted automatic speech recognition systems

This paper reports on the early stages of a locally funded research and development project taking place at Rennes 2 university. It aims at developing a comprehensive pedagogical framework for pronunciation training for adult learners of English. This framework will combine a direct approach to pronunciation training (face-to-face teaching) with online instruction using and adapting existing Au...

متن کامل

The effectiveness of computer-based speech corrective feedback for improving segmental quality in l2 dutch

Although the success of automatic speech recognition (ASR)-based Computer Assisted Pronunciation Training (CAPT) systems is increasing, little is known about the pedagogical effectiveness of these systems. This is particularly regrettable because ASR technology still suffers from limitations that may result in the provision of erroneous feedback, possibly leading to learning breakdowns. To stud...

متن کامل

Let’s Take it to the Clouds: The Potential of Educational Innovations, Including Blended Learning, for Capacity Building in Developing Countries

In modern decentralised health systems, district and local managers are increasingly responsible for financing, managing, and delivering healthcare. However, their lack of adequate skills and competencies are a critical barrier to improved performance of health systems. Given the financial and human resource, constraints of relying on traditional face-to-face training to upskill a large and dis...

متن کامل

Explicit Pronunciation Training Using Automatic Speech Recognition Technology

A system is described, provisionally named Pronto, which uses automatic speech recognition (ASR) for training pronunciation of second languages in adult learners. The first version of Pronto was developed for native speakers of American English learning Spanish and for Mandarin Chinese speakers learning English. Pronto grows out of work in the Indiana Speech Training Aid (ISTRA) research progra...

متن کامل

Developing Speech Recognition and Synthesis Technologies to Support Computer-Aided Pronunciation Training for Chinese Learners of English

Copyright 2009 by Helen Meng Abstract. We describe ongoing research in the development of speech technologies that strives to raise the efficacy of computer-aided pronunciation training, especially for Chinese learners of English. Our approach is grounded on the theory of language transfer and involves a systematic phonological comparison between the primary language (L1 being Chinese) and seco...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2016